On Ambiguity Detection and Postprocessing Schemes using Cluster Ensembles
نویسندگان
چکیده
In this paper, we explore the cluster ensemble problem and propose a novel scheme to identify uncertain/ambiguous regions in the data based on the different clusterings in the ensemble. In addition, we analyse two approaches to deal with the detected uncertainty. The first, simplest method, is to ignore ambiguous patterns prior to the ensemble consensus function, thus preserving the non-ambiguous data as good “prototypes” for any further modelling. The second alternative is to use the ensemble solution obtained by the first method to train a supervised model (support vector machines), which is later applied to reallocate, or “recluster” the ambiguous patterns. A comparative analysis of the different ensemble solutions and the base weak clusterings has been conducted on five data sets: two artificial mixtures of five and seven Gaussian, and three real data sets from the UCI machine learning repository. Experimental results have shown in general a better performance of our proposed schemes compared to the standard ensembles.
منابع مشابه
Diversity-Based Weighting Schemes for Clustering Ensembles
Clustering ensembles has been recently recognized as an emerging approach to provide more robust solutions to the data clustering problem. Current methods of clustering ensembles typically fall into instance-based, cluster-based, or hybrid approaches; however, most of such methods fail in discriminating among the various clusterings that participate to the ensemble. In this paper, we address th...
متن کاملA Novel Method for Detecting Targets on Inactive Radars Using an Adaptive Processing on the Ambiguity Function (RESEARCH NOTE)
In this paper a novel method for detecting targets in inactive radars is presented. In this method, the time history of cellsof the ambiguity function is used for detection. For this purpose, the cell history is considered as a random field. Then, using adaptive filter, the string time of the desired target are separated from the string time of noise and clusters in the environment. In order to...
متن کاملUsing Diversity in Preparing Ensembles of Classifiers Based on Different Feature Subsets to Minimize Generalization Error
It is well known that ensembles of predictors produce better accuracy than a single predictor provided there is diversity in the ensemble. This diversity manifests itself as disagreement or ambiguity among the ensemble members. In this paper we focus on ensembles of classifiers based on different feature subsets and we present a process for producing such ensembles that emphasizes diversity (am...
متن کاملUnsupervised Emotional Scene Detection from Lifelog Videos Using Cluster Ensembles
An emotional scene detection method is proposed in order to retrieve impressive scenes from lifelog videos. The proposed method is based on facial expression recognition considering that a wide variety of facial expression could be observed in impressive scenes. Conventional facial expression techniques, which focus on discriminating typical facial expressions, will be inadequate for lifelog vi...
متن کاملRelating ensemble diversity and performance: A study in class noise detection
The advantage of ensemble methods over single methods is their ability to correct the errors of individual ensemble members and thereby improve the overall ensemble performance. This paper explores the relation between ensemble diversity and noise detection performance in the context of ensemble-based class noise detection by studying different diversity measures on a range of heterogeneous noi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010